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Abstract 

It remains an open question whether the apparent additional power 
of quantum computation derives inherently from quantum mechanics, or 
merely from the flexibility obtained by "lifting" Boolean functions to lin- 
ear operators and evaluating their composition cleverly. Holographic al- 
gorithms provide a useful avenue for exploring this question. We describe 
a new, simplified construction of holographic algorithms in terms of Pfaf- 
fian circuits. Novel proofs of some key results are provided, and we extend 
the approach of [51] to nonsymmetric, odd, and homogenized signatures, 
circuits, and various models of execution flow. This shows our approach 
is as powerful as the matchgate approach. Ifolographic algorithms pro- 
vide in general 0(n"'') time algorithms, where Up is the order of Pfaffian 
evaluation in the ring of interest (with 1.19 < Up < 3 depending on the 
ring) and n is the number of inclusions of variables into clauses. Our ap- 
proach often requires just the evaluation of an n x n Pfaffian, and at most 
needs an additional two rows per gate, whereas the matchgate approach 
is quartic in the arity of the largest gate. We give examples (even before 
any change of basis) including efficient algorithms for certain lattice path 
problems and an 0(n'^^) algorithm for evaluation of Tutte polynomials 
of lattice path matroids. Finally we comment on some of the geometric 
considerations in analyzing Pfaffian circuits under arbitrary basis change. 
Connections are made to the sum-product algorithm, classical simulation 
of quantum computation, and SLOCC equivalent entangled states. 

1 Introduction 



Valiant's (48; matchgates give a variety of linear operators on two qubits which 
can be composed according to certain rules into matchcircuits, thereby simulat- 
ing certain quantum computations in polynomial time. An alternative formula- 
tion into planar matchgrids is also available [49 , and is equivalent P3] • In this 
paper we detail a third formulation. If a circuit is built from matchgates, and 
the input and any bit of the output is fixed, the probability of observing that bit 
can be computed in classical polynomial time. Valiant's two-input, two-output 
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matchgates are characterized by two linear operators acting independently on 
the even and odd parity subspaces of a pair of qubits, with the restriction that 
the operators have the same determinant. More generally matchgates are char- 
acterized by the vanishing of polynomials called matchgate identities. These 
gates correspond to linear optics acting on non-interacting fermions with pair- 
wise nearest neighbor gates and one-bit gates gSl |30] ■ The theory of matchgates 
has been further developed by Cai et al., who explained the relation to tensor 
contractions [T7j . 

We describe a framework for constructing and analyzing holographic algo- 
rithms which is equivalent to the matchgate formulation, but which does not 
use matchgates. This builds upon our results in j34| . The main contributions 
of the present work are as follows. Theorem 14.41 gives a new, explicit PfafSan 
ordering together with a proof of its validity. Section 15.21 extends our construc- 
tion [34] to asymmetric, odd, and homogenized signatures, which shows that 
it is as powerful as the matchgate approach. We then analyze the cost of this 
extension. Given a #CSP instance with n inclusions of variables into clauses, 
our construction often requires just the evaluation of the Pfaffian of an n x n 
matrix. Theorem 15.51 states that in the worst case, the extension requires an 
additional two matrix rows per homogenized predicate. Section r5.3l derives some 
new algorithms for lattice path enumeration using these results. Finally, Theo- 
rem [Hil] gives the first result on holographic algorithms with heterogeneous basis 
change: all arity-three predicates are implementable. 

In Section [2l we give necessary background, definitions, and notation. In 
Section [3l we describe the tensor contraction circuit formalism for readers un- 
familiar with it. In Section |4l we introduce Pfaffian circuits and the method 
for evaluating them in polynomial time. In Section [5l we analyze simple and 
compound Pfaffian predicates in the standard basis, which all holographic com- 
putations are reduced to before evaluation, and relate them to spinor varieties. 
We also derive some algorithms for lattice path enumeration problems as exam- 
ples. Finally in Section IH] we comment on some of the geometric considerations 
in analyzing Pfaffian circuits under arbitrary basis change. 

2 Preliminaries 

It is by now well known [31[TTl[T71[21inZll3HlllIlllH] that the counting version 
of a Boolean constraint satisfiability problem (#CSP) can be expressed as a 
tensor contraction. Moreover, such tensor contraction networks or circuits serve 
as good general models for computation capable of being specialized to classical 
Boolean circuit computation, quantum computing, and other settings. Restrict- 
ing the set of tensor circuits allowed in various ways, one can accurately capture 
several complexity classes including Boolean circuits, BQP, #P, etc. [22]. 

It is an open question whether the additional power of the quantum compu- 
tational models comes inherently from the special features of quantum mechan- 

^An example of how a quantum computation can be viewed as a #:CSP tensor contraction 
is the following. All the preparations and operations are naturally expressed in terms of tensor 
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ics, or merely from the additional freedom which comes from "lifting" Boolean 
functions by viewing them instead as linear operators. After such a lifting, we 
may study the complete computation as a tensor contraction, which may have 
a surprisingly efficient algorithm to evaluate. Indeed, this has been a known 
open avenue to a possible (full or even partial [Hill]) collapse of the polynomial 
hierarchy. 

2.1 Tensors and predicates 

Let yi, . . . , be 2-dimensional vector spaces and F^^, l/^v^ _ _ ^ yny ^j^gj^. 
duals. Fix a basis {vq,v\\ for each T^'; this will be called the standard basis 
(e.g. in quantum computing, these could represent choices of orthogonal bases). 
Let {i/Q, ul} be the dual basis of V" , so v^j{v],) = 5jk- Fixing an order of the 
we denote an induced basis element oi ® V"^ ® ■ ■ ■ ® V" by a bitstring such 
as 

|0100 • • • 10) = "f^l «) (g) • • • (g) W""^ (g) Vq 

and e.g. (00111 • • • 10| (n bits) for an element of the induced dual basis. Note 
that we omit the scripts identifying the vector spaces when the correspondence 
intended is clear from context. When we need to keep track of the vector spaces 
involved, we write 

IO1I4O7) for vl(S)vf(»vl. (1) 

A bitstring x is sometimes abbreviated by the set of ones in the bitstring; e.g. 
we write | J), with J — {3,4} for the induced basis element |0011) when n — A. 

A Boolean predicate or relation is a formula in a Boolean algebra (e.g. p =^ 
q, py q), i.e. a truth table. By designating some variables as inputs and others 
outputs, such that all possible inputs have a unique output, a predicate can also 
be viewed as a function or gate. If there are more than one output, it could 
be viewed as a nondeterministic gate; with an input missing, a gate capable of 
terminating some computation paths. 

With the bitstring bra-ket notation, it is natural to express Boolean pred- 
icates and functions in the same terms as arbitrary linear transformations or 
tensors. A predicate becomes the formal sum of the rows of its truth table 
as bitstrings. For example OR3 = (|0) + - |000). Thus any Boolean 

predicate can be viewed as a multilinear operator. Thus any Boolean pred- 
icate is a multilinear operator. When we want to think of, e.g. AND2,i = 
|00,0) -|- |01,0) -|- |10, 0) -|- |11,1) as a gate or function accepting two bits as 
input and outputting another, we separate the input and output by a comma. 
However nothing has really changed and this is just to aid in reasoning about a 
circuit when we orient its edges (see Figure [1]). Viewed as linear operators in an 
explicit basis, predicates may be composed by matrix multiplication. Tensors 

products and contractions; indeed a quantum circuit diagram is a type of tensor contraction 
network. Suppose n input bits are placed in uniform superposition. A unitary operator is 
applied which represents a CSP and outputs many garbage bits and one satisfiable-or-not bit 
s. Then the probability of observing s = l is the number of satisfying assignments divided by 
2". 
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with a mix of primal and dual vector spaces have mixed ality. Say IO1O2I3) is 
a (3) ality tensor and (I4I5I6I7I is a (q) tensor, |0i02l3)(l4l5l6l7| is (3)- If 
instead we have (O2I3I4I5I, the partially contracted (O2I3I4I5IO1O2I3) is a (^) 
tensor equal to |Oi)(l'*l^|. A Q tensor (e.g. (O1O2I3IO1O2I3) = 1) is a scalar. 
A gate or n-gate is a (^) tensor of this type (e.g. OR3, AND2^i), and a cogate 
or n- cogate is a (g) tensor. A general (^) tensor is a predicate. The arity of an 
(J^) predicate is n + m, the tensor degree or number of edges incident when the 
predicate is included in a circuit. 

We can define a restricted class 3^ of allowed predicates. Formally these are 
a (possibly infinite) collection of unlabeled predicates |/) which map subsets of 
[n] of size |/| to labeled predicates, in a computation of size n. These generate 
a space of possible circuits or tensor formulae called compound predicates by 
tensor product and partial contraction. A generating set of predicates which 
enables us to construct all Boolean functions is said to be universal. Because 
swap and fanout are not always included in such a generating set, we cannot 
simply appeal to results such as Post's lattice [l^ to analyze the space of circuits 
computable by a given generating set of predicates. Worse, for holographic 
algorithms there are further restrictions on how available predicates can be 
composed to form circuits. In some cases, merely determining whether one 
predicate is implementable in terms of another can be undecidable |21j . 

Theorem 2.1 (|21j). Without Janout, there is no general procedure for deciding 
the following question: Given a predicate X, can a graph of X's he built that 
implements a given desired predicate Y ? 

The study of holographic algorithms and related questions is in large part 
concerned with studying certain restricted families of predicates, subject to 
additional composition rules, and asking what the resulting space of circuits 
can compute. The result is an interplay between the algebraic geometry which 
describes the space of allowed predicate families and the combinatorial and 
computational complexity theory needed to analyze the power of compositions 
to compute useful things. 

In the language of universal algebra [M], which has been central to the 
recent #CSP dichotomy theorem over finite relational structures (i.e. for 
combinatorially unrestricted circuits of unliftcd predicates), our construction 
can be described as follows. Let A be a finite alphabet and A" the set of all 
n-tuples of elements of A or n-ary relations over A. Let Ra = U„gN^" be 
the set of all finitary relations over A. A constraint language, or restricted 
class of predicates L is subset of Ra- The set of implementable predicates is 
characterized by a closure with respect to invariants. 

Theorem 2.2 (j4l I25j). Without variable or combinatorial restrictions (i.e. 
with the presence of fanout and swap ), the set of predicates implementable by 
compositions of a given set of predicates L is the clone of L, Inv(Pol(_L)). 

This yields a Galois correspondence between Ra and sets of finitary oper- 
ations on A analogous, for example, to the correspondence between ideals and 
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algebraic varieties. See [5J [T3] for details. Denote by CRa the tensor al- 
gebra built from the vector space CA spanned by the elements of A. This is 
just the noncommutative ring over C, whose monoid has elements n-tuples of 
elements of A and semigroup operation tuple concatenation. There is a natural 
embedding which we call lifting of Ra into CRa which sends an n-tuple of A 
to the corresponding induced basis vector. We have the additional wrinkle of 
putting variables and constraints on the same footing, so that we need both a 
constraint language L C Ra and a variable language V C Ra (in fact lifted 
versions of both). In this paper we restrict attention to the binary case |yl| = 2. 
We also note that this lifting can be profitably expressed in terms of functors 
of monoidal categories, but postpone details of this approach to future work. 

Given such a restricted class of predicates, introducing a change of basis 
expands, or more properly rephrases, the set of predicates available. To the 
extent the rephrasing allows us to work with familiar Boolean predicates or close 
relatives, this makes the computational complexity aspect more manageable. 
However the two predicates connected by a shared vector space, one with the 
primal and one with the dual, must use the same basis on that vector space. 
Suppose A is the change of basis on such a vector space V, with 

|0> |1) 
^ _ |0) f floo aio 



1) \aQi ail 

so that A : |0) H> aoo|0) -|-aoi|l) and |1) ^ aio|0) +aii|l). Then the dual 
change of basis applying (A^^)^ on is A'^ : (0| (det (aii(0| - 
aio(l|) and (1| {detA)-^{ - aoi(0| + aoo(l|). We can see that 1 = (0|0) = 
(A^((0|),A(|0))) = (det A)-i(aiiaoo(0|0) - aioaoi(l|l)) = 1 and similarly for 
(1|1); moreover (0|1) = (0|1) = 0. We record this observation as follows. 

Proposition 2.3. Applying a change of basis A to any vector space factor of 
the tensor space of a contraction and to its dual does not affect the value of 
the pairing. 

However, applying a change of basis on each vector space of a pairing may 
enable us to simplify its evaluation. The process is somewhat analogous to sim- 
plifying or decomposing a matrix or tensor to a more manageable form by Givens 
rotations. Here the "rotations" serve to translate between predicates which are 
convenient to reason about in computational complexity terms (e.g. Boolean 
predicates) and predicates whose composition can be evaluated efficiently. 

A very useful change of basis is the Hadamard change of basis 



H = 



1 1 

1 -1 



mapping |0) i-^ |0) + |1) and |1) ^-> |0) - |1); we have i?^ = ^H. We avoid the 
more natural self-dual ^/2H only to maximize the opportunity for exact integer 
arithmetic. In most cases any constants can be ignored until organizing the final 
computation. 
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Example 2.4. The Not- All-Equal clause is true if the variables are not all equal 

7Vyl£^3 ^ |001) + |010> + |100) + |011) + |101) + |110). 

Under the Hadamard change of basis it equals 

H®^NAE-i = 6|000) - 2|011) - 2|101) - 2|110), 

which we will see makes it Pfafhan, and hence tractable to compute with under 
planarity. Note that the basis-changed version only makes sense in Ci?yi and 
not Ra- 

3 Tensor contraction circuits 

A circuit is a composition of predicates that computes something of interest. 
Given a collection of (co)gates (or predicates), we may diagram their contraction 
by means of a bipartite graph which shows which pairing to make. 

Definition 3.1. A tensor contraction circuit (or simply circuit) F is a combi- 
natorial object consisting of 

(i) a set 3^ of tensor predicates with coefficients in a field F, the gadgets we 
may use to build the circuit, and 

(ii) a graph F = (Predicates, Edges) such that each predicate in Predicates 
is drawn from each edge of a predicate corresponds to one of its vec- 
tor spaces, and two predicates sharing an edge have dual vector spaces 
corresponding to that edge. 

This F represents the circuit architecture: which vector spaces are paired, in- 
cluding the assignment for non-symmetric predicates. 

The value val(F) of a circuit is the result of the indicated maximal tensor 
contraction. A circuit is closed if the contraction is total: all vector spaces 
appearing are paired and the result is a field element (Figure [IJb),(c)). This 
field element is the "answer" to the corresponding #CSP instance the circuit 
represents, computed over the field (more generally, we could use a ring or 
even semiring). In the statistical setting, it is the partition function. If there 
are unmatched wires or "dangling edges," the circuit is open and its value is a 
tensor (Figure [Ija)). In the statistical setting, this is a marginal distribution 
over a subset of variables, or the most likely assignment if working over the 
tropical semiring Predicates in (below, predicates of the form sPf 5 

or sPf G), are called simple. A compound gate or cogate is a partial circuit 
composed of predicates in which represents a partial pairing but still has 
dangling edges, and for which the ality is that of a (co)gate respectively after 
the partial contraction is performed (Figure [5]). A general compound predicate 
can have mixed ality. 
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Figure 1: Viewing XOi?2,i as (a) an open circuit, logic gate, or function; on the 
left it "accepts inputs" and on the right "outputs" their sum modulo two. After 
pairing all vector spaces, we have (b) a closed circuit whose value is because 
the indicated assignment is not in the graph of the function. In (c) we show the 
#CSP interpretation; the value of this closed circuit is 2 because two of the four 
possible assignments to the open circles (representing summation), (0|(1| and 
(1|(0|, yield elements of the function's graph (contract to 1). That is, val(r) = 
((l|)((0| + (l|)((0| + (l|)-(XOi?2,i) = ((100+(101| + (110| + (lll|)-(XOi?2,i) = 2. 




Figm-c 2: In (a), a swap gate built of X0i?2,i (or EVO3) and FAN0UTi^2 (also 
called =3). Multiple orientations of the edges produce valid functional interpre- 
tations on the circuit. Reversible gates are a special case of the freedom to orient 
in any direction that makes each predicate act as a function. In (b), arrows are 
removed and edges curved to the top to suggest a #CSP interpretation. 
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Observation 3.2. In reasoning about the value of a closed circuit, we may 
treat compound predicates as black boxes: they will behave exactly as though 
there were an actual predicate in terms of their contribution to the circuit value. 

For example, to represent a #3SAT problem, we could use an arity-d all- 
equal (AE'^) cogate to represent each variable appearing in d clauses, OR3 
gates to represent clauses, with an edge if a variable is included in a clause. The 
value of such a circuit is the number of satisfying assignments. If the tensor 
coefficients are not in the set {0, 1}, it is called a weighted sum or weighted 
#CSP problem and its value is a weighted sum or polynomial. Note that even 
in the weighted setting, an open circuit may still have {0, 1} coefficients after 
the partial contraction is performed; indeed it can be worthwhile to engineer 
things this way to implement Boolean functions with restricted sets of (possibly 
non-Boolean) predicates. 

Open circuits are useful conceptually, and "evaluate" to tensors, i.e. mul- 
tilinear transformations. However, we primarily evaluate only closed Pfafhan 
circuits F, as these are the type which have a general polynomial time algo- 
rithm (Algorithm [IJ to obtain val(F). An open circuit is a bit like a quantum 
computation before, and a closed circuit a quantum computation after, inputs 
and measurements have been made. 

If the dangling edges of an open circuit can be partitioned into a set of k 
inputs and £ outputs, such that every possible input in {0, 1}'' has a unique 
nonzero coefhcient for some assignment of values in {0,1}^ to the outputs, we 
can view the open circuit as representing a function; the open circuit's value 
is a tensor which is the formal sum of points in the graph of the function, 
^ /(k)) ■ ^ € {0: l}*^}- If the output assignment is not unique, one 
might interpret this as a nondeterministic function. This is useful in using an 
orientation to compute a circuit value as in Section 13.11 We can also consider 
orientations of the edges that mark inputs and outputs appropriately. If we 
then make a partial assignment to the input and output wires, and sum over 
the rest, we can compute various things about the function so represented. For 
example, suppose the circuit is a multiplication circuit, so the input is the bit 
representation of two integers and the output their product. Fix the output to 
be a given integer. Now trying a zero and one assignment to each bit of input 
and summing over the rest, and checking to see if the resulting closed circuit 
value is nonzero, we can find a factors of the given integer by binary search. 

The notion of circuit value is inherently of the counting type, but can be 
specialized to answer decision, function, and ^function questions. Input, out- 
put, and ^function vs #decision problems are just a matter of perspective on 
time flow through the circuit. The fact that tensor contraction networks can be 
interpreted in more than one way depending on the orientation or flow of time 
chosen is familiar to physicists in the guise of the time-invariance of Feynman 
diagrams. 
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3.1 Evaluating tensor circuits with factorization 



The sum-product algorithm ^ [33] , which computes certain exponentially large 
sums of products in polynomial time, is one of the most ubiquitous algorithms in 
applications. It is a strategy for computing the value of a tensor circuit which 
has been rediscovered many times under different names. The sum-product 
algorithm is used in graphical models (belief propagation and junction tree), 
computer vision, hidden Markov models (forward-backward algorithm), genetics 
(Needleman-Wunsch), natural language processing, sensor networks [33], turbo 
coding [33] , quantum chemistry, simulating quantum computing [37] , SAT solv- 
ing (survey propagation) and elsewhere. The sum-product algorithm applies to 
problems phrased in terms of requirements, clauses, or factors (here, "gates") on 
the one hand and variables ("cogates") on the other, and for which the bipartite 
factor graph with an edge (w, c) if variable v appears in clause c is a tree. This 
factor graph is precisely a tensor contraction network or circuit as described 
previously. It works over any semi-ring and exploits the tree structure to factor 
the problem, thus reducing a seemingly exponential problem to one which can 
be computed in polynomial time. However, despite approximate generalizations 
|40] . the algorithm is fundamentally limited to the case where the underlying 
factor graph is a tree. 




Figure 3: Kschischang et al.'s factor graph [331 Fig. 1]. Each variable Xi 
corresponds to the Boolean cogate tensor AE'^ where d G {1,2} is the degree 
and each factor A-E to a gate tensor with arbitrary coefficients. Choosing a root 
vertex yields a factorization of the tensor contraction for efficiently computing 
the value of the open circuit created by omitting that vertex. 

Let r = (y, C, E) be a tree. Then the sum-product algorithm is the factor- 
ization of the total and partial contractions achieved by rooting at a vertex and 
factoring accordingly. Explicitly, we write the tree in Cayley format in terms of 
nested angle brackets, keeping the variables (cogates) on the left and tensoring 
over the leaves at each level. For Kschischang et al.'s example factor graph in 
Figure [H the total contraction factored by rooting at xi is the scalar 

val(r) = {xi,A(S{x2,B(d {x3,C<» {xa,D) ® {x^.E)))). 

Partial contraction of all but xi yields a marginal function of Xi as the following 
element oiV^ ®V'^: 

val(r') ^ A® {x2,B® {x3,C <^ {xi,D) (g) (xg, £'))). 
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This is a tensor-valued partial contraction, the value of the open circuit F' 
formed by removing the node labeled xi from F, so that edges 1 and 9 are 
dangling. 

The sum-product algorithm, particularly in its junction-tree [3S] guise, is 
useful in computing the value of circuits with small treewidth. It has been used 
to simulate quantum computations ^7\. Note that one strength of the sum- 
product algorithm over holographic algorithms is that it requires the tensor 
coefficients only to lie in a semiring, whereas holographic algorithms require 
ring coefficients as they exploit cancellation. We will limit ourselves further to 
coefficients in a field, typically C. 

4 PfafRan circuits and their evaluation 

In [?71 HSl ES] L. Valiant introduced a different approach to the same sort of 
problem (an exponentially large sum of products diagrammed by a factor graph). 
Using matchgates and holographic algorithms, he proved the existence of poly- 
nomial time algorithms for counting and sum-of-products problems that naively 
appear to have exponential complexity. Such algorithms have been studied in 
depth and further developed by J. Cai et al. [171 IHl HI]- In particular they 
are able to provide exact, polynomial time solutions to problems with non-tree 
graphs but are subject to other, somewhat opaque algebraic restrictions. Holo- 
graphic algorithms apply if the coordinates of the predicates involved, expressed 
as tensors, satisfy a collection of polynomial equations called matchgate identi- 
ties, generally after a change of basis on the underlying vector spaces (one space 
per edge) . Currently the same change of basis has always been applied to each 
edge, so simply allowing non-global basis change may provide additional power. 

Each edge of a closed tensor circuit represents a pairing of a vector space and 
its dual. Performing a change of basis B along an edge of a tensor circuit means 
applying the transformation B to the primal and B^ to the dual, which does 
not affect the value of the circuit (Proposition 12 .3|) . We may view the result as a 
new circuit in the standard basis, with the same graph but different predicates, 
and the same value as the original circuit. This provides an alternative method 
to find holographic reductions [49] from one problem to another: rather than 
reason about gadgets and changes to the graph, one can just write down the 
desired circuit, perform a change of basis on each edge, and obtain a new circuit 
which has exactly the same value but a different set of predicates. 

One reason to perform changes of basis is that the new set may often have 
a sparsity pattern, or satisfy other conditions that make evaluation easier. For 
example, it might now be possible to separate an arity-four predicates into 
two arity-two predicates, allowing a better factorization. The improvement of 
interest to us, however, is that the new predicates can sometimes be made to 
satisfy the matchgate identities. When this is possible, and the graph is planar, 
circuit evaluation can be accomplished very efficiently. 

A Pfaffian circuit, open or closed, is a tensor contraction circuit: a collection 
of Pfaffian gates and cogates arranged in a bipartite graph, e.g. to represent 
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a Boolean satisfiability problem or a circuit computing a function. There are 
restrictions on both the predicates allowed (which must be PfafSan) and the 
ways in which they may be composed (planarity and matching bases). Because 
of the relationship of Pfafhan circuits to PfafRans, their closed circuit values 
(full tensor contractions) can be computed efficiently. 

4.1 PfafRans and subPfafRans 

The PfafRan of an n x n skew-symmetric matrix S is zero if n is odd, one if 
n = 0, and for n > Q even is the quantity 

a 

where the sum is over permutations where cri < cr2, crs < (T3, . . . , Un-i < (Jn, 
and ai < (73 < (75 < ■ ■ ■ Un-i- Careful consideration of the special structure 
of a Pfaffian corresponding to a planar graph can yield an 0(n"p) algorithm, 
where the order of planar Pfaffian evaluation ujp depends on the the ring and 
is generally between 1.19 and 3; see the discussion after Algorithm [T] If S is 
empty, 2 x 2, 4 x 4, or 6 x 6, the Pfaffian is 1, ^12, - 6364 + and 

£.12 Pf S3456 - f 13 Pf ^2456 + Ci4 Pf 52356 " '^is Pf ^2346 + Pf S2345 respectively. 

Definition 4.1. For nxn skew-symmetric matrices S and 0, define the tensors 

sPf(S) = ^ Pf(S,)|/) (2) 
/cH 

and 

sPf^(e)= ^ Pf(e,;c)(j| (3) 

./C[n] 

where |/) denotes the bitstring tensor which is the indicator function for / and 
S/ is the submatrix of S including only the rows and columns in /. These 
easy-to-represent 2" dimensional tensors are the building blocks of holographic 
computations. We can also define versions of ([2]) and ^ in which the tensor 
bits, and corresponding rows and columns of the matrix, are explicitly labelled 
as in dD). 

Definition 4.2. A gate G (resp. cogate J) with n edges is Pfaffian over a field 
F if there exists an n x n skew-symmetric matrix S (resp. O) over F and a field 
element a (resp. (3) such that G = asPf S (resp. J = /3sPf^ 9). 

Note that for nonsymmetric predicates, permuting the edges does not affect 
whether or not the predicate is PfafRan. Which edges correspond to which bits 
of the gate is part of the data of the circuit. 

Now we consider how to form the direct sum of labelled matrices and the 
tensor product of labelled tensors. Consider a set M of matrices, each of which 
has its rows and columns labeled by a different subset of the numbers [n] = 
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{!,...,«}, such that the label sets partition [n]. Suppose G M with label 
sets I,JC [n], / n J = 0. Define 5 ® 5' to be the direct sum of 2 and S', i.e. 
the matrix S" with label set / U J which has ^^'^ = if one of k, £ is in / and 
one in J, and otherwise the entry is whatever it would have been in S or S', 
e-g- e^'. = e^. if{fc,nc/. 

4 5 7 9 
65 67 \ 
64 69 
^44 C49 
C55 ^57 
C75 ^77 

^94 ^99/ 

Similarly if T = 01x2x^x1) and T' — j3\x3XiXg) are two tensors with labeled bits, 
define T (E)T' io be al3\x2X^XiXc,x-jXQ) and extend linearly to general tensors. 

Now suppose r = (Gates, Cogates, Edges) is a combinatorial connected pla- 
nar bipartite graph with n edges. A graph is bipartite iff it does not contain an 
odd cycle. Thus all of the regions defined by a planar embedding of a planar 
bipartite graph have an even number of edges. Therefore every vertex in the 
dual of r has even degree, and there exists a non-self-intersecting Eulerian cycle 
in the dual. This cycle corresponds to a oriented closed planar curve through F 
crossing each edge of T exactly once; oriented, it defines an ordering or labelling 
of the edges by {!,...,«} where the rth edge crossed gets the label r. The 
corresponding closed curve separates the cogates from the gates, and induces 
a clockwise or counterclockwise cyclic (but not necessarily consecutive) edge 
order on the edges incident on any vertex. Both embedding and curve can be 
computed in 0{n) time. A proof that such a curve produces a "valid" edge 
order appears in |34| . We now give a new proof of this result, specializing to a 
particular curve in order to simplify the argument. 

Given F and a planar embedding, define a new graph F' whose vertices are 
the gates only. If m gates are incident to an interior face of F, include the edges 
connecting them in a m-cycle around the region. See Figure2]for an illustration. 
The resulting graph F' is planar; take a spanning tree. Choose a single exterior 
gate and connect it with a line to a circle drawn in the plane to contain F'. Now 
thicken the edges, circle, connecting line, and vertices representing the gates, 
and orient the boundary of the resulting region. This defines a closed curve 
crossing each edge of F exactly once. Call the result a spanning tree order on 
the edges of F. 

We claim that such a curve and order could also have arisen if we had begun 
with the cogates instead and performed the same construction. By construction, 
the gate side of the curve is connected, and so is its complement in the plane. 
In other words, any cycle among the cogates is contractable to a path in the 
spanning tree. Were it not, then we would have a gate which is isolated from 
the rest. The exterior cycle among the cogates is prevented by the connection 
to the exterior circle. 
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Figure 4; Planar spanning tree curve construction: (a) the circuit, (b) the region 
graph on its gates, and (c) an oriented spanning tree curve. 



The curve construction is depicted in Figure HI Another feature of this 
particular curve is that if a gate node is replaced by a compound gate, we 
can just invaginate the curve between two edges of the gate and extend the 
construction recursively (Figure [5]). 

Lemma 4.3. Suppose the edge labelling comes from a spanning tree order. Let 
Gi = sPfS^, G2 = sPfS^ be edge-labelled Pfaffian gates connected in this 
spanning tree (and so by a cogate) in a closed circuit T. Gi and G2 have 
disjoint label sets /i,/2 C [n] respectively. Then Gi ® G2 ^ sPf(S"'^ "E?) as an 
identity of edge-labelled tensors. 

Proof. We have 

Gi(8)G2 = sPf(Si)0sPf(s2) 



(j2 pf(s^ji^i>) f E pf(Sjji^2) 

\JlC/l / \J2C/2 

^ Pi{E]jPi{E%)\J,UJ2) 
■J1CI1.J2CI2 

E Pi{EKnijPmlniJ\K) 
KCI1UI2 
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Figure 5: Recursive view of the planar spanning tree. The dashed hue is a 
compound gate. 

whereas 

Kdiuh 

so it is sufficient to show that for all K C Ii U I2 the coefficients are equal, 

pf(sUjPf(sUj = Pf((s^©s^)/^)- (4) 

This is a slight generalization of the fact that for skew-symmetric matrices 
M, M' with rows and columns labelled consecutively where 1, . . . ,k label M 
and k + !,...,£ label M', then 

sPf (M) (g) sPf (M') = sPf (M ® Af ') ^ sPf (^^^ , (5) 

so we reduce to this case. If K is completely contained in Ii, both sides of 
Equation ^ equal Pf(S^) and similarly if X C /2. Suppose K has nontrivial 
intersection with both label sets, li Ii O K is even and I2 K is odd or vice 
versa, both sides of Equation Q are zero, li Ii D K and I2 D K are both odd, 
the left hand side is zero but we must show the right hand side is also zero. But 
The matrix 'E} may be written as a block-diagonal matrix by applying the 
same permutation to the rows and columns. Thus the left hand side and right 
hand side of ([4]) are equal up to the sign of this permutation, and the right hand 
side is also zero. 

Now suppose IiOK and 12^1 K are both even. We would like to reduce to the 
block diagonal case ([5]), i.e. the matrix with labelling i\, 13, ... , *i7 ■ • ■ j ^^/^ 
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where «i < ^2 < ' ' ' < ^^'^ *i < *! < ' ' ' ^ *f/2|- claim that the permu- 
tation connecting the interleaved curve order ji , . . . jn of ® to the block 
diagonal order is even. This is where we will need the curve order. 

Draw a small curve separating the two gates and the cogate connecting 
them, with the small curve crossing each edge to the outside world (Figure H] 
(e)). The big planar spanning tree curve enters and leaves this circle in the 
same "wedge" between two edges, i.e. across a face of F'. Thus we may assume 
the order induced on the edges of the gates we are considering is the lower part 
of the Gi indices in (as consecutive as possible) ascending order, followed by all 
the G2 indices in ascending (as consecutive as possible) order, followed by the 
remainder of the Gi indices in order. Since G2 has an even number of edges, 
the permutation required to move the top half of the Gi indices past them to 
join the lower half (so the indices are in block diagonal order) is even. □ 

When joining compound gates, the argument is unchanged because of the 
recursive nature of the spanning tree order (Figure 11(d)). Thus the Lemma 
shows that we may join two compound gates and maintain the correct Pfaffian 
value, and inductively join all the gates. The critical part of the following result 
is that the same edge order works for both gates and cogates. 

Theorem 4.4. Let T be a connected bipartite embedded planar graph with the 
edge order induced by a planar spanning tree curve, and let {Gi} be the collection 
of edge labelled Pfaffian gates, Gi = sPf S^. Then ®iGi = ®iSi. The analogous 
result holds for cogates with the same curve order. 

The total contraction of the dual gate and cogate sub-Pfaffian tensors can 
be computed efficiently. The following result follows directly from one which 
appears in [13 p. 110]. 

Theorem 4.5 (Pfaffian kernel theorem). Let and S be nx n skew- symmetric 
matrices with n even. Let Q be Q with signs flipped in the checkerboard pattern 
which flips the diagonal. Then the pairing 

(sPf" e, sPf S) = Pf(e + S). (6) 

On the left hand side is an exponential sum, which can be over all "inputs" , 
and on the right a quantity which can be computed with the complexity of Pfaf- 
fian evaluation. This kernel allows us to compute the value of a closed Pfaffian 
circuit efficiently. For example under certain conditions, we may evaluate the 
sum over all inputs (for ^decision and ^function problems) in a way analogous 
to a quantum unitary operator acting on a superposed state. Note that Valiant's 
Pfaffian Sum Theorem [35] is Theorem 14.51 with 9i^j — XiXj. 

A circuit was defined in Definition 13.11 for general predicates. We now add 
some restrictions to facilitate fast evaluation by Theorems 14.41 and 14.51 Every 
edge e in F corresponds to to a vector space . A typed predicate is a predicate 
together with a type labeling each edge, which indicates which basis change we 
make on that edge's vector space. 
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Definition 4.6. A Pfaffian circuit is a combinatorial object with 

(i) a set 5^ of typed gates and ^ of typed cogates witli coefficients in a field 
F, the gadgets we may use to build the circuit, and 

(ii) a connected bipartite graph F — (Gates, Cogates, Edges) where each gate 
in Gates is drawn from 5^ and each cogate from ^ , and where the gate 
and cogate incident on each edge agree on the edge type. This represents 
the circuit architecture: which vector spaces are paired, including the 
assignment for non-symmetric predicates, 

such that every typed gate and typed cogate is Pfaffian after the indicated 
change of basis, and F is planar. 

The value of a closed Pfaffian circuit can be computed efficiently. 

Theorem 4.7. The complexity of computing the value of a closed Pfaffian cir- 
cuit over a field ¥ with graph F is the complexity 0{n^^) of computing a planar 
Pfaffian with graph F over F. 

Proof. Apply Theorems l4.4l and HT51 One must accumulate in a quantity 7(Gates, 
Cogates) the adjustment factors a and (3 of Definition 14. 2[ which thus depend 
only on the number and type of gates and cogates appearing in the closed Pfaf- 
fian circuit. An explicit algorithm to obtain the value of a circuit is given in 
Algorithm [1] its complexity is dominated by evaluating the Pfaffian. 

□ 

Algorithm 1. 

Input: A closed Pfaffian circuit F with an even number n of edges. 
Output: The value val(F) of the circuit. 

1. If not given, find a planar embedding of F {0{n) time [lOl [27l [9]), and 
define a planar spanning tree edge order (0(n)). 

2. Form VL, a skew-symmetric n x n matrix of zeros as follows. Set 7 = 1. 

(a) For each Pfaffian gate G with changes of basis transformation Tq and 
Tg{G) = aa sPf (Sg) (after basis change) with edges I = {ii, . . . ,id} 
incident, let 7 ac^ and set the submatrix 57/ = Sg. 

(b) For each Pfaffian cogate J with changes of basis transformation Tj 
and Tj{J) — /3jsPf^(0j) with edges / = {ii,...,id\ incident, let 
7 ^ /3,77 and set Vti = Qj with 6*^ = {-iy+^+^0.,j . 

3. Output 7 Pf (17). 0(n"p), the order of Pfaffian evaluation. 

We have used LUp for the order of Pfaffian evaluation because it can depend 
on the ring; we now collect a few results on what is known about ujp. The 
evaluation of the Pfaffian can be accomplished without division over an integral 
domain in O(n^) |24| . One can also exploit the planar sparsity structure of the 
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graph to get a faster evaluation. Avoiding writing down explicitly, and using 
a nested dissection algorithm [36] for Pf(r2), the evaluation can be obtained up 
to sign in 0(n"/^), where w is the exponent of matrix multiplication. Over 
finite fields this smaller exponent can be achieved with randomization |53| . A 
discussion of numerical issues can be found in Section 12 of [15] . 

After all changes of basis in a closed circuit, we are left with a circuit in 
which all edges are in the standard basis. The change of basis device is just 
a means of making non-obvious transformations of predicates with which it is 
natural to express the problem of interest (e.g. Boolean functions) into pred- 
icates which are generally non-Boolean, but whose composition represents the 
same counting problem. Thus in the next section we first describe Pfafhan pred- 
icates in standard basis, where all holographic computations end up before the 
contraction is performed. 

5 PfafRan predicates in the standard basis 

In this section we study which predicates can be implemented as the subPfaffian 
of some matrix, or small contractions of a few predicates which can, and relate 
these to spinor varieties. We first study the easiest case of PfafRan gates: gates 
of even- weight support that are subPfafhans, initially described in [34]. This 
corresponds to the restriction of the even spinor to an affine open. Then we study 
the affine odd case, and finally the homogenized, projective versions. The first 
two are easy to characterize in terms of the image of a natural parameterization 
by subPfaffians and in terms of defining equations, and represent set-theoretic 
affine complete intersections. The projective versions will require additional 
equations. The results in this section mirror those for matchgate signatures in 
the standard basis (e.g. [T5]). 

5.1 Affine spinor varieties and matchgates 

It is easy to characterize which tensors can be written as the subPfaffian tensor 
sPf S of some skew-symmetric matrix S. For a (^) tensor T and an n-bit string 
X, denote by T]^^^ — {x\{T) the coefficient of \x) in the expansion of T in the 
standard basis. Then the equations a degree n gate must satisfy to be the 
subPfaffian tensor of some matrix are 

1. Parity: Tyj.'^ — whenever there are an odd number of ones in the bit- 
string X (2" — 1 equations), and 

2. Consistency: if n > 4, there are consistency conditions imposed by 
larger Pfaffians being writable in terms of smaller Pfaffians. It is enough 
to consider all the Pfaffians in terms of the individual matrix entries. For 
example for a (°) gate G, this requires that 

G'loooo)^'!!!!!) = lG|iiii) — G|iioo)G|ooii) — G|ioio)G'|oioi) +^10110)^11001) 
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where Gioooo) — 1 since the PfafRan of the empty matrix is one. We wih see 
shortly that (1) may be exchanged for the complementary requirement that 
even weight coefficients must be zero by considering a partial contraction of an 
(n + l)-gate and a 1-cogate, and (2) may hkewise be homogenized by partial 
contraction. 

Since in the standard basis one of these parity restrictions will always hold, 
it is convenient to consider the variety as a subvariety of the vector space W+ 
of even or W- of odd weight induced basis elements. 

Together these equations are sometimes called Grassmann-Pliicker relations, 
matchgate identities [3H]j generators of the ideal of a spinor variety, or defining 
equations of the (even) orthogonal Grassmannian [32] ■ They also arise naturally 
in connection with Clifford algebras. The space of PfafSan (affine with even 
support) gates is the image of the polynomial map sPf(E!) as S varies over all 
skew-symmetric matrices in F and so has the structure of an algebraic variety. 
The requirements are the same for the dual with the obvious modifications 
(support on bitstrings with an even number of zeros, etc.). Taking the Zariski 
closure in ~ ^ , or or PVF_ as appropriate and homogenizing will lead to 
the projective version. 

We now connect these requirements to the original matchgate identities |3H] • 
Valiant's [48j two-input, two-output gates act independently on the parity sub- 
spaces and the determinant of both must be the same. To see this, consider a 
(!^) gate G interpreted as two-input, two-output gate G. 
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Writing out the linear transformation it represents (as one might for a unitary 
operator in quantum computing), we get a matrix in the "operator" ordering, 
viewing bit 4 as the output corresponding to input bit 1 and bit 3 as the output 
corresponding to input bit 2, which is 







0102) 


IO1I2) 


I1102) 


I1112) 


I0403) 




foil 


bi2 


biz 


&14 \ 


I0413) 




b1\ 




b23 


^24 


I1403) 




hi 


bs.2 


bss 


634 


I1413) 


v 


641 


bi2 


^43 


644 / 



In the cyclic ordering coming from the planar curve of Theorem l4.4[ the matrix 
is 
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Valiant's matchgate equations are 
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1. Parity: &12 = &13 = 621 = ^24 = ^31 = ^34 = &42 = ^43 = 0, i.e. the coefficient 
of any four-bit |a;) wliere x has an odd number of ones must be zero, and 

2. Consistency: 611644 - 622633 = 614641 - 623632. 

In other words, the gate must have the form, in the operator ordering. 



IO1O2) 
IO4O3) / 611 
IO4I3) 



II4O3) 
II4I3) 



and the matrices 
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acting on the even parity subspace and 

Bodd = 



622 
632 



623 
^33 



acting on the odd must have the same determinant, det i^ei^en = det Bodd- When 
this holds and 611 = 1, so 644 = 614641 — 623632 + 622633, we can write G as a 
subPfafEan (suppressing the below-diagonal entries): 

/ 



G = sPf 
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So G = IO1O2O3O4) + 614IIIOO) + 623IIOIO) + 633IIOOI) + 622IOIIO) + 632IOIOI) + 
641IOOII) + 644IIIII) as desired. When 611 ^ 1, we may multiply sPf S by 
a constant a that will be accumulated in 7. When 611 =0, we will need to 
perform a contraction. 



5.2 Homogenization, odd support, and projective spinor 
varieties 

We now extend to (1) affine spinor varieties supported on the odd parity sub- 
space of and (2) the homogenized version where we obtain full projective 
spinor varieties. 

First, note; that the support of a predicate is the set of induced basis elements 
(e.g. jllOl)) with nonzero coefficients. 

Proposition 5.1. A predicate in the standard basis which is Pfaffian, or a 
partial contraction of Pfaffian predicates, has support either on the induced basis 
elements of odd or even weight. 
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Proof. First, note that the contraction of one simple predicate against another, 
where one is completely contracted, follows the usual parity rules: an odd and 
an odd or even and even make even, and mixed odd and even make odd. A 
single gate or cogate, not partially contracted, has pure parity. Then we proceed 
by induction on the number of predicates. □ 

This corresponds to the odd S- and even S+ spinor varieties. The odd 
and even Spiuj^ varieties |23i p. 390] each have dimension (2) , and lie in either 
the corresponding to the projectivization of the even weight vectors 

or the odd weight vectors. Note that in our dehomogenized version, where we 
consider the image of sPf S in the afhne open where the coefficient of |00 • • • 0) 
is one, the (2) dimensions correspond directly to the (2) free parameters of S. 
We now show how to explicitly parameterize these odd and even varieties, and 
their homogeneous versions (where the coefficient of |00 • • • 0) may be zero). The 
parity switch to the odd variety and the homogenization can be accomplished by 
partial contraction. Though obscured by change of basis, this division remains 
when we consider predicates with basis change. For example, no basis change 
can make |00) + into the 2|00) - |01) - |10) needed for X-matchings, but 
this is easily done reducing to |01) + |10) (see Example 16. 2p . Adding one edge 
and the arity-one dual predicate to a predicate flips its parity in the sense of 
Proposition 15.11 

In what follows we ignore a scaling factor that can be absorbed in the con- 
stant a or /3 by which a subPfafFian vector is multiplied and which is accumulated 
in 7. We first observe that parity is the only restriction up to arity three. 

Proposition 5.2. A gate of arity at most three can be realized in the standard 
basis by partial contraction of Pfaffian predicates if and only if it is supported 
on even or odd weight. 

Proof. We describe the situation for gates; the cogate version is completely 
analogous. The only if direction is Proposition 15.11 In the standard basis, the 
unique (up to scaling) arity-one simple gate is |0), the subPfaffian tensor of the 
1x1 zero matrix. This is the even weight piece; the odd weight gate |1) can be 
obtained as the partial contraction sPfS((l|) for the 2x2 matrix S with free 
parameter — 1: 

val ( [0] ) - |0) and val ( = | 1) . 

Similarly, the ONE^ junction is Pfaffian and ZERO^ is odd Pfaffian in the 
standard basis: 

val ( ® ) = (1 1 and val ( (=)-\^ = (0| ■ 

Linear combinations xjO) + y|l) where both x and y are nonzero are impossible 
(without change of basis). 

For arity two gates, Proposition l5 . 1 I tells us we should get support {|00), |11)} 
or {|01), |10)}; the first comes from pure gates and the second from gates with 
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arity three contracted against the arity-one junction. 

val(^^) = |00)+a2|ll) val(^ (D) = 63IOI) + ^lallO) 



The arity three case is similar. The (even) PfafEan case, the image of sPf 
apphed to a 3x3 matrix, gives points of the form |000) +^i2|110) +^i3|101) + 
^23|011) and scalar multiples where G|ooo> 7^ 0- For the odd Pfaffian case, we 
get arbitrary points, where S is 4 x 4, 



val ( ={e\ Q)) = 64IOOI) + 64IOIO) + CullOO) + A|lll) 



where A = Pf S, and the four coefficients can be chosen independently with no 
relations (so this fills the odd weight space and there is no need for a homoge- 
neous odd case). The homogeneous even case is, with a 5x5 matrix S and 2x2 
matrix O with free parameter 9 so sPf^ 9 = 6(00\ + (11 1, 

val(^f^) = (0 + ^5)1000) + (0f 1245 + ^12)|110) 

+ (0^1345 + 63)1101) + (06345 + 63)1011), 

Where 6ifc« := Pf Si^w Setting 62 = 63 = 63 = 65 = we obtain 

vai(r) = 0|ooo) + (6564-6465+)|iio) 

+(6564 - 6465)1101) + (6465+6564 - 6465)1011), 

an arbitrary point of varying 9 and the remaining free entries of S. □ 

Essentially this is because for gates of arity at most three, there are no 
nontrivial Pfaffians in the subPfaffian vector. Thus the first case where the 
variety of realizable gates is not completely characterized by the support is 
arity four, where also a consistency condition must hold. Here the even parity 
piece is a (2) = 6-dimensional variety in the open affine C"^ C = ¥{{\ijk)}) 
where Gioooo) = 6 — 1j so a hypersurface, cut out by the 4x4 Pfaffian quadric 

^10000)^11111) — G|iioo)G|ooii) + G|ioio)G|oioi) — G|oiio)G|iooi). 

The odd case is similar. 

For n = 5, we again have a set-theoretic affine complete intersection with a 
10-dimensional variety in C^^ cut out by (^) = 5 Pfaffian consistency conditions 
of the form 6jfc£ = Pi'^ijki- 

The homogenized version is not a complete intersection, in fact this is an 
important example in connection with the conjecture of Hartshorne that for n > 
|iV every nonsingular n-dimcnsional variety in is a complete intersection. 
The six-dimensional Grassmann variety in and this 10-dimensional spinor in 
P^^ are the only varieties X corresponding to orbits of linear algebraic groups 
for which n — '^N where X is not a complete intersection [53] p. 64]. In this 
case, the additional five relations needed in the homogeneous case are of the 
form [39] 

626345 — 636245 + 646235 ~ 656234 
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and are easily shown to lie in the ideal of the Pfaffian relations when we restrict 
to the case G|ooooo> = ?0 ^ 0. 

For n = 6, we have a (2) = 15-dimensional subvariety of P'^^ defined by 
(4) (e) ~ equations, and so on. 

Proposition 5.3. The set of even gates in the standard basis is a set-theoretic 
affine complete intersection in the open ajjine G|oo...o) 7^ 0; cut out by the non- 
trivial Pfaffians. 

Proof. The dimension is (2) and the dimension of the ambient space is 2"^^ — 1, 
while the number of equations is 




The first can be rearranged to 




In the matchgrid/matchcircuit approach, implementing a gate of arity r may 
require 0{r'^) edges [50} [20]: 

Theorem 5.4 (5.1 in [SO], based on 4.1 in f5(T ). For any matchgrid with r 
external nodes there is another matchgrid with the same standard signature that 
has O(r^) edges of which O(r^) have weight different from 1. 

This means that a matchgate holographic algorithm has complexity 0{rf-,^^^n'^''), 
where r^ax is the arity of the matchgate with the most incident edges. In con- 
trast, in our approach r^^^^ is replaced by the constant two. 

Theorem 5.5. Any predicate of arity n (n external nodes) which is Pfaffian, 
or the image of a Pfaffian predicate after basis change, can be implemented with 
at most two additional edges connecting to one additional predicate that has no 
external edges. 
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Proof. The parity-switch and homogenization constructions in the proof of Propo- 
sition [5]2] work as well for gates of arbitrary arity. Then to implement any gate 
in the standard basis, we need to add at most one new predicate and at most 
two new edges. □ 

It would be interesting to see if this could be combined with lower bounds 
to strengthen Valiant's result (Theorem 5.2 in 50") that there is no elementary 
matchgrid algorithm for 3 CNF by weakening the hypotheses in the condition 
'elementary.' 



5.3 Some problems in the standard basis 

Let us consider some sample problems that can be solved without resorting to 
a change of basis. One illustrative class are certain lattice path enumeration 
problems. A recent survey is |28) . With zero-one weights, these are purely 
combinatorial counting problems. If we place weights on the edges, we can 
use the fast partition function to do parameter inference in the corresponding 
statistical model over paths or loops efficiently. 

Example 5.6 (Lattice paths on a region of a square grid). Consider the problem 
of counting paths in a square grid (Figure [7]). The number of monotone (N,E) 
lattice paths from (0, 0) to (m, n) in a to x n grid is (""Jj^") . Suppose more 
generally we want to count the number of paths using steps in any of the four 
directions, subject to some restrictions. For example we might want the path to 
start somewhere and finish somewhere else, to pass through certain segments, 
or not to intersect itself. By representing these conditions in the clauses, we 
can sometimes compute these quantities with a Pfaffian holographic algorithm 
in the standard basis. 

Suppose we want a satisfying assignment to correspond to a monotone path. 
Then we need to (1) set a start and end point, (2) require that at each vertex 
other than the start and end, the path both enters and leaves and (3) exclude 
South and West moves. We can accomplish (1) by adding a (1| junction attached 
to the desired start and end vertices as shown in Figure[7{b). Requirement (2) is 
enforced by only allowing, at each gate corresponding to a grid vertex, either the 
activation of no edges or an even number of edges. This happens automatically 
if we use the even parameterization and the standard basis. Since no grid vertex 
may have an odd number of active edges incident, the path must grow from the 
start and end vertices. 

Since in general Pf S ^ 0, and this is the coefficient of a crossing -I- of two 
paths, paths with loops and even isolated loops are generally included (loops are 
counted only once, with no winding) . South and east moves are also included. 

Observation 5.7. Given a start vertex Vs and end Ve vertex weakly northeast 
of it, monotone paths connecting them are exactly those words w on alphabet 
N, S*, E, W containing no ES or NW subwords, and such that wVs — Ve- 

The six possible paths through a vertex correspond directly (Figure [6]) to 
the six entries of each S, so to exclude a certain path we need only zero out its 
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entry. With the labelhng of Figure [HI setting ^34 = disallows the subwords 
£^5* and NW in the path, which with the start and end conditions restricts to 
monotone paths (and excludes isolated loops). 

For many special cases of lattice path walks, explicit results are available (e.g. 
in terms of a generating function). For example, the number of monotone paths 
from (0, 0) to (n, n) in an n x n grid that stay weakly above the line y = x is the 
nth Catalan number C„ = (n + l)^^ (^^) . The nth Schroder number counts such 
paths but with allowed step N, E, or NE to (n, n), the central Delannoy number 
counts paths with these steps but without the above-diagonal restriction, and 
so on. 

In general suppose we are given any connected lattice region consisting of 
n boxes, together with a start and a finish vertex. Then we can write down a 
skew-symmetric matrix with at most (8n -I- 2) rows (with this bound achieved 
in the case of diagonal boxes), the square root of the determinant of which is 
the number of monotone paths through the region connecting the start to the 
finish. This yields an O(n'^J') algorithm for performing this count. The resulting 
matrix can also be examined to obtain expressions that simplify the Pfafhan. 
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Figure 6: The (2) — 6 paths through a vertex correspond to the six free pa- 
rameters of a 4 X 4 skew-symmetric matrix S, with the crossing of two paths 
through the vertex corresponding to Pf S. 
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Figure 7: A monotone path (a) and the two squares in its lower left (b) showing 
the start point and underlying factor graph. A satisfying path is shown, with 
edges carrying a one doubled. Junctions are AE"^ and gates sPf E for whichever 
S we are considering. 

We can evaluate weighted versions, force certain path segments to be in- 
cluded, introduce holes in the region, and so on at no computational cost by 
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varying our six-parameter choice of S at each grid vertex. Including m start 
and end vertices counts famihes of nonintersecting monotone paths. 

Setting all the free entries of the S corresponding to each internal vertex 
to one allows moves in all four directions, loops, and self-intersection. Since 
Pf S = 2 in this case, crossings contribute a multiplicative factor of 2. In other 
words a path contributes 2^*^°'^ intersections ^^le sum over all paths. 

We can check our computation for the 1x2 grid in Figure [71[b); fix the edge 
order shown in Figure [7l^b). For both monotone and general paths, O has a one 
in the places {(2, 3), (4, 5), (6, 7), (9, 10), (11, 14), (12, 13), (15, 16)}. To enforce 
the starting and ending conditions, we disallow (2, 16) and (7, 9) (otherwise we'll 
get the perimeter loop included) in S. For general paths from start to finish, S 
has an ^i<j = 1, i, j e / submatrix for subsets / € {(1, 2), (1, 16), (3, 4), (5, 6, 12), 
(7,8,9),(10, 11),(13, 14,15)}. There are no sign flips for 6. Then Pf(e + 
S) = 4 as expected. For monotone paths, Q is the same but S is adjusted 
to exclude moves (13,14) and (5,12) , / e {(1, 2), (1, 16), (3, 4), (5, 6), (6, 12), 
(7,8,9),(10, 11),(13, 15),(14, 15)}. Then Pf (6 -h S) = 3 as expected. To count 
closed loops with no start or finish fixed, and each edge traversed at most once, 
we use / e {(1, 2, 16), (3, 4), (5, 6, 12), (7, 8, 9), (10, 11), (13, 14, 15)} for S and get 
Pf(9 + S) = 4: empty, left or right square, and perimeter. Some Python code 
that performs the computation is as follows (here we take advantage of the fact 
no sign flips arc needed). 

import numpy as np 
from math import sqrt 

onesabove = lambda n:np. triu(np. ones ( (n.n) ) ) - np. tril (np. ones ( (n.n) ) ) 
def count (xiones .thetaones .edges) 
A=np. zeros ( (edges , edges , int) ) 
for I in thetaones+xiones : 

J=[i-1 for i in I] #adjust to index from zero 
A [np . ix_(J, J)] = onesabove (len( J) ) 
return sqrt (np . linalg. det (A) ) 

thetaones=[(2,3) , (4,5), (6,7), (9,10), (11,14), (12,13), (15,16)] 
#first the loops and crossings allowed version 

xiones=[(l,2) , (1,16) , (3,4) , (5,6, 12) , (7,8,9) , (10,11), (13,14,15)] 
#monotone version 

xiones=[(l,2) , (1 , 16) , (3 ,4) , (5 , 6) , (6 , 12) , (7 ,8, 9) , (10, 11) , (13, 15) , (14, 15)] 
#no start or finish version 

xiones=[(l,2,16) , (3,4) , (5 ,6 , 12) , (7,8, 9) , (10,11), (13,14,15)] 

Krattenthaler [32] , reduces a certain count of lattice paths to evaluation of 
a Pfaffian in a particular case. Such problems are related to Hilbert series of 
ladder determinental rings. Our example shows how such algorithms (in fact 
formulae) for various lattice path counting problems can be derived in general 
using holographic algorithms. 

Example 5.8. Bonin et al. [7] give a polynomial time algorithm for computing 
the Tutte polynomial of a lattice path matroid, refined and generalized to a 
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O(n^) algorithm for multi-path matroids in [S]. Computing the Tutte polyno- 
mial of many classes of matroid, including arbitrary transversal or bicircular 
matroids, is #P-complete [55]. We now show how a Pfaffian circuit can be ap- 
plied to evaluate the Tutte polynomial of a lattice path matroid in 0(n"p) time, 
a substantial improvement. 

Following [7], a lattice path matroid is defined as follows. Let P and Q 
be two lattice paths (0,0) to (m, r) which do not cross. Let {pu^ , ■ ■ ■ Pu^} , 
ui < U2 - ■ -Ur the set of North steps of P, and {pi^, . . -Pe^}, (-i < ■ ■ ■ ir the 
set of North steps of Q. Let M[P,Q] be the transversal matroid with ground 
set [m -I- r] = {1, 2, . . . , m -I- r} and presentation (iVj : i G [r]) where Ni are 
the sets of consecutive integers {£i, . . .Ui). Lattice path matroids have many 
nice properties, such as being closed under matroid duality and direct sums. 
There are ^(C„ -I- (|-„"2i) connected lattice path matroids on n -I- 1 elements up 
to isomorphism. A lattice path matroid is any matroid isomorphic to such a 
matroid. Because of the following theorem, we can just consider a lattice path 
matroid to be given by the paths inside the region defined by P and Q. 

Theorem 5.9 ([7]). A subset B of [m + r] with \B\ ^ r is a basis of M[P, Q] 
if and only if the associated lattice path P{B) stays in the region bounded by P 
and Q. Thus the number of bases of M[P, Q] is the number of lattice paths from 
(0,0) to {m,r) that go neither below nor above Q. 

The Tutte polynomial t{M; x, y) of a matroid M in terms of the internal i{B) 
and external activities e{B) of a basis is 

tiM;x,y)= Yl ^"^''V^''^- 

BeSg(M) 

This is a useful formulation as we have the following. 

Theorem 5.10 ([7j). Let B be a basis of the lattice path matroid M[P, Q] and 
let P{B) be the associated path. Then i{B) is the number of times P{B) meets 
the upper path Q in a North step and e{B) is the number of times P{B) meets 
the lower path P in an East step. 

By selection of the S at each vertex, we have total freedom as to the weight 
of each possible two-edge path segment passing through any lattice point. Thus 
we can just set the relevant at the vertices along P and Q such that the value 
of the circuit is t{M;x,y). We conjecture that this approach can be extended 
to a fast algorithm for computing the Tutte polynomial of multipath matroids, 
where the path-basis relationship is more subtle. 

Example 5.11 (non-self-intersecting paths on a hexagonal grid). Using an even 
trivalent Pfafhan gate (such as ^12 = ^13 = C23 = l)j again paired with AE^ 
junctions, leads to a hexagonal lattice (Figure where at each grid vertex 
only no path, or a path which both enters and exits, is allowed. We also have 
not excluded separate closed loops. Thus we can count the (possibly weighted) 
number of non-self-intersecting paths (with extra loops) from a start vertex to 
a finish vertex in a connected region with n hexagons in 0{n^'') time. 
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Figure 8: Non-self- intersecting path in a hexagonal grid and lower left portion 
of underlying factor graph. Gates on the boundary are (possibly scaled) AE2 , 
since there is only one path through them. 

Because of the fast partition function available for such weighted path count- 
ing problems, estimating the S parameters of a statistical model over paths by 
maximum likelihood becomes feasible. 

6 Signatures PfafRan after basis change 

We postpone the majority of this investigation to future work but mention a 
few results here. 

6.1 Symmetric signatures under homogeneous basis change 

The space of symmetric signatures under homogeneous basis change has been 
well studied in algebraic geometry for well over a century. Let y be a two di- 
mensional vector space; then the space of symmetric signatures up to scale is 
P(Sym" V) = P", and the action of a homogeneous change of basis is the action 
of PGL2. The orbits have been computed in many cases. The symmetric signa- 
tures which are realizable are precisely those which have an orbit representative 
in S+US-. 

Symmetric signatures were analyzed in [15]. We give a more concise geo- 
metric proof of some of these results. 

Proposition 6.1. All arity three symmetric signatures are Pfaffian after ho- 
mogeneous basis change. 
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Proof. In the arity three case [13 Cor. 3.1], which corresponds to P(Sym'^ V), 
there are three PGL2 orbits. The PGL2 action is the projective motions of 
twisted cubic [v^], the points [v'^ ■ w] with v,w independent, and the points 
[u ■ V ■ w], with u, V, w pairwise independent. 

Thus to show that these are implementable under basis change relative to 
a set of restricted predicates, (here the predicates which are Pfaffian in the 
standard basis = 5+ U 5- ) , we just show that a representative of each orbit 
hes in This is clear since is Pfaffian, |0)°2 o |1) = |001) + |010) + |100) 
is odd Pfaffian, and |0) o (|0) + |1)) o (|0) - |1)) is Pfaffian. □ 

As n increases the number of orbits is no longer finite. In that case PGL2 
equivalence of points has a more complex description in terms of invariants (this 
is a subject of classical and geometric invariant theory). That is, one can find 
invariants of the PGL2 action such that two points lie in the same orbit iff 
the invariants take the same values (and that the ring of invariants is finitely 
generated was the motivation for the Hilbert basis theorem). This means that 
a description of the attainable signatures can be had by considering the variety 
traced out by the invariants as we vary the point in 

Example 6.2. The problem #Pl-Mon-Rtw-NAE-SAT was treated in [34]. We 
now give an example of a problem, X-matchings, from |49j that requires a 
homogeneous basis change, and couldn't be treated in [SI] because it requires 
the extensions to odd parity discussed above. 

Problem 6.3 (X-matchings [49]). The input is a planar edge-weighted bipartite 
graph {V, E, W) where the vertices V = Vi U V2 are again partitioned in two, 
and the vertices in Vi have degree 2. The output is the sum of the weight of 
all matchings, where the weight of a matching is the product of the weights of 
the included edges, and — {wi + ■ ■ ■ + Wk) for all the unsaturated vertices in V2, 
where wi, . . . ,Wk are the weights of the edges incident to that unsaturated node. 

The change of basis for X-matchings, which is the same on all edges, is 

-^G -I) -'^G -1) ^"^C -0 

Let junctions (generators) represent VI vertices in the original matching prob- 
lem with J = (00| + (01| -t- (10|. The gates (recognizers) replace V2 vertices, 
where a degree d vertex has gate ~ —d\0 . . . 0) -I- |0 . . . 01) + • • • + 1 10 . . . 0) . 
Here we need to use a parity switch. After basis change we obtain 

((0| + (1|)«^-((0| + (1|)(1|-(1|((0| + (1|) 
(001 -(111 

-2|00> + |0(0- 1)) + 1(0-1)0) 
-2|00) + 2|00) - |10) - |01) = |10) + |01) 
-d\0)'^'^ + dlO)®"^ - |0 ... 01) |10 . . . 0). 



A'»^{G2) - 
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Now 



sPr 



-1 

1 



where A'^'^iGd) is the subPfaffian of the (d + 1) x (d + 1) matrix S with ones 
in the {i^d + l)th entries, i < d + 1, and zeros for the rest of the free entries, 
contracted against the arity-one junction 



6.2 PfafRan implementable predicates under free basis change 

We now consider the question of which predicates become compound Pfaffian 
after arbitrary, heterogeneous basis change over C. Since this corresponds to 
the XGL2 action in the binary case, it can be studied as a special case of the 
kind of orbit classification which follows from the work of Kronecker, and has 
been studied in the general Lie algebra context by Vinberg [5T1 , Kac [3T] , 
Kostant-Rallis, and others. Parfenov [13] spelled out the classification of spaces 
of tensors with finite orbits under a product of GL actions in complete detail, 
with number of orbits, representatives, and the orbit abutment graph. This 
classification was rediscovered in the physics literature in the subsequent decade 
under the name "classification of entangled states up to SLOCC equivalence." 

We have already seen that val(r) fills either the odd or even weight subspace, 
i.e. equals P(S'+) or P(S'_), for arity one or two. The consequence is that the 
entire orbit of each parity subspace is accessible. For arity three, there are six 
nonzero GL2 x GL2 x GL2 orbits in (8) (8) . It is well known that they 
have the following orbit representatives. 





= |000) 




N2 


= |000) + 


110) 


N3 


= |000) + 


101) 


Ni 


= |000) + 


|011) 


N5 


= |000) + 


|011) + |101) 


Ne 


= |000) + 


1111) 



The first five are Pfaffian gates, and the last is Pfaffian in the Hadamard basis. 
After possibly an X change of basis, all are also realizable as junctions after 
some basis change. Considering these gates or junctions as the generators in a 
#CSP or circuit, we have the following. 

Theorem 6.4. All arity three predicates are Pfaffian, possibly after a free basis 
change. 

Corollary 6.5. In particular, all SLOCC equivalence classes of tripartite en- 
tangled qubits can be generated as Pfaffian predicates under free basis change. 

Of course, there are further restrictions required to obtain a Pfaffian circuit; 
these come into play when we want to compute with these states. For arity 
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four and above, there are already infinitely many orbits under arbitrary basis 
change and more sophisticated geometric techniques are needed to analyze the 
situation. Thus we postpone this analysis to future work. 
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